ACQUISITION  OF  CAUSAL  INFORMATION  FROM  MEDICAL 
DOCUMENTS  IN  FRENCH  FOR  EXPERTS 


Wonsil  Park,  Maryvonne  Abraham 
ITI,  Ecole  Nationale  Superieure  des  Telecommunications  de  Bretagne,  France 
LaLIC,  University  of  Paris4-Sorbonne 


Abstract  -  This  paper  discusses  how  to  select  causal 
information  from  medical  documents,  in  particular 
the  information  structured  by  verbal  locutions  in 
French.  Causal  relations  are  represented  not  only  by 
verbs,  X  CAUSES  Y,  but  also  by  other  different 
information,  for  instance  a  temporal  order  between 
cause  and  effect,  different  meanings  of  the  effect,  and 
so  on.  With  an  example,  we  propose  how  to  categorize 
causal  relations  in  order  to  obtain  much  more  detailed 
information  from  medical  documents.  This  method  is 
useful  to  experts  to  increase  knowledge  data  base,  and 
it  could  help  students  in  Medicine  to  understand  and 
organize  their  tasks. 
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I.  INTRODUCTION 

The  interest  of  our  research  is  obtaining  maximal 
information  from  documents  that  we  get  everyday. 
Our  research  is  constituted  of  two  phases: 
extracting  information  on  causal  relations  from 
medical  documents  in  French  and  representing 
various  consequences  of  relation  cause-effect. 
French  language  shows  a  number  of  expressions  to 
describe  causal  relations:  simply  in  temporal  order, 
connectors  like  parce  que  (because),  car(since), 
adverbs  like  done  (so),  alors  (then),  or  verbal 
locutions  like  provoquer  (produce),  pousser  a 
(push),  declencher  (activate),  etc.  A  causal  relation 
is  not  always  uniform  as  “  X  causes  Y  ”,  Natural 
languages  are  very  rich  in  semantic  representation. 
A  verb  supplies  to  us  not  only  a  causal  relation 
between  X  and  Y,  but  also  how  they  react  each 
other,  and  even  the  consequence  generated  from 
this  relation:  for  instance  “A  is  a  cause  of 
development  of  Y  ”,  “A  is  a  cause  of  generation  of 
Y\  “A  is  an  immediate  cause  of  Y  ”,  etc.  We  have 
approached  to  this  aim  by  analyzing  French  verbal 
meanings,  since  a  verb  does  not  have  a  unique 
meaning  and  each  meaning  relies  on  relations 
between  arguments  and  semantic  invariant.  This 
research  will  complete  and  detail  causal  relations. 

II.  CONTEXTUAL  EXPLORATION  SYSTEM 

A  text  understanding  processing  is  constructed  of 
several  procedures.  The  systems  that  we  present  in 


this  paper  are  developed  by  the  Laboratory  LaLIC1. 
The  contextual  analyzing  technique,  developed 
from  theoretical  researches  of  LaLIC,  allows  us  to 
search  the  semantic  contents  of  texts.  We  will  not 
discuss  in  detail  about  these  systems  and  we  will  be 
content  with  describing  them2.  The  architecture  of 
plat-form  FilText  (fig.l)  shows  the  procedure  from 
documents  to  three  different  finalities,  extracting 
information,  summarizing  texts  and  representing 
information  in  graphs. 

FilText  is  a  semantic  filtering  system  oriented  to 
users.  Its  contextual  exploration  motor  calls  and 
executes  the  rules  since  an  analyzed  text.  It  is  a 
conceptual  model  relying  on  the  structuration  of 
texts,  mainly  grammatical  structure,  because  the 
grammar  encodes  a  part  of  semantics. 

The  plat  form  FilText  is  general  in  the  following 
meanings:  the  data  base  of  linguistic  markers  and 
rules  can  be  increased  so  that  the  quality  of 
semantic  filtering  becomes  better;  it  is  possible  to 
add  other  modules  to  the  modules  already 
constructed  without  any  change 

Our  research  is  especially  oriented  to  extracting 
information  of  causal  relations  from  medical  texts. 
Characteristics  of  the  medical  domain  are  more 
complicated  than  the  mechanics.  Because  there  are 
much  more  unknown  factors  which  might  influence 
on  results.  Symptoms,  diagnostics,  medication,  etc. 
are  different  and  various  according  to  a  patient’s 
conditions  -  personal  (age,  genetic  factors,  etc.)  and 
environmental  (aliment,  climate,  pollution,  etc.). 
Therefore,  there  should  be  some  unhealed  factors 
which  could  lead  to  totally  unexpected  results.  A 
causal  structure  in  the  medical  domain  stays 
hypothetical  until  we  obtain  a  result. 

III.  LINGUISTIC  REPRESENTATION  OF 
CAUSAL  RELATIONS 

Extracting  causal  information  from  texts  obliges  us 
several  fundamental  tasks,  which  consist  to 
linguistic  and  philosophic  researches.  Above  all,  we 


1  Langage,  Logique,  Informatique,  Cognition.  We  can  consult 
more  information  about  these  system  in  the  website 
www.lalic.paris4.sorbonne.fr/html/contexte.html 

2  For  more  information,  see  Descles  et  alii.  (1996),  S.  Ben- 
Hazez,  J.-L.  Minel  (2000)  and  G.  Crispino,  S.  Ben-Hazez,  J.-L. 
Minel  (1999).  It  is  also  available  in  the  website  of  LaLIC, 
www.  lalic  .paris4 .  sorbonne.  fr 
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should  understand  what  the  causality  means  and 
how  a  causal  relation  is  organized.  First  of  all,  the 
causality  is  an  interpretation  of  world  that  we 
perceive  and  a  relation  between  events,  a 
reorganized  relation  in  a  coherent  way  in  order  to 
give  an  explanation.  We  give  a  first  condition  to 
find  a  causal  relation3. 

Condition:  X  is  a  cause  of  Y,  if  Y  is  not 
chronologically  situated  before  X  and  if  X 
and  Y  can  be  described  separately. 

However,  this  condition  is  not  enough  to  satisfy  all 
descriptions  of  the  cause-effect  relation.  Natural 
language  demonstrates  many  ways  to  describe 
causal  relations  between  phenomena.  In  French 
language,  we  can  describe  them  simply  in  temporal 
order  and  reorganize  with  the  help  of  linguistic 
elements:  connectors  ( done  (so),  alors  (then),  si  - 
alors  (if  -  then),  parce  que  (because),  car  (since), 
puisque  (because),...),  verbs  (provoquer  (produce), 
declencher  (activate),  causer  (cause),  resulter  de 
(result  from),  ...),  or  we  can  emphasize  and  detail 
causal  relations:  adverbs  ( X  provoque  Y 
immediatement,  lentement , . . .  “X  produces  Y 
immediately,  slowly...”),  negation  (X  ne  declenche 
pas  Y,  “X  does  not  activate  (or  start)  Y”  ),  modality 
(X  peut  provoquer  Y,  “X  can  produce  Y”),  tenses 
(X provoquerait  Y,  “X  would  produce  Y”.)...  These 
linguistic  elements  influence  on  modification  of 
causal  relations. 

The  causal  relations  represented  by  verbal  locutions 
are  explained  in  three  categories:  the  static  relation , 
which  explains  relations  between  cause  and  effect, 
for  instance  the  identification,  X  est  la  cause  de  Y 
(X  is  the  cause  of  Y);  the  kinematic  relation,  which 
encodes  spatio-temporal  relations,  for  instance  the 
origin,  Y  vient  de  X  (Y  comes  from  X);  the  dynamic 
relation,  describing  the  role  of  cause  and  the 
reaction  of  effect,  for  instance  the  starting,  X 
declenche  Y  (X  activates  Y). 

We  start  by  analyzing  the  Semantics  of  verbs  which 
organize  the  dynamic  relation.  Because  the  cause  is 
not  considered  as  a  simple  origin,  but  it  influences 
on  the  construction  of  effect.  Certain  verbs  describe 
clearly  causal  relation  and  others  need  contextual 
information.  In  order  to  extract  causal  information 
from  texts,  we  need  more  information  than  isolated 
words  of  the  text.  We  can  organize  causal 
temporality  as  a  primal  criterion.  Because  effect 
cannot  happen  before  Cause. 


J  It  is  very  hard  to  define  the  causality  (causal  relation)  in  one 
word.  Because  there  are  so  many  various  descriptions  and 
definitions  about  this  complex  notion.  A.  Jackiewicz  discussed 
this  problem  in  her  Ph.D  thesis  [A.Jackiewicz,  1998]. 


IV.  CHRONOLOGICAL  RELATION  AND 
EFFECT  MODALITY  IN  CAUSALITY 

First  of  all,  from  the  semantic  analysis  of  French 
verbs,  which  organize  dynamic  relations,  we  have 
obtained  three  chronological  relations  between 
Cause  and  Effect. 

1.  Immediate:  we  do  not  perceive  any 
important  temporal  intervention  or  time  length 
between  Cause  and  Effect:  creer,  provoquer, 
produire,  declencher,  causer,  ... 

2.  Non  immediate:  A  time  length  between 
Cause  and  Effect  is  more  important,  entrainer, 
conduire  a,  aboutir  a,  ... 

3.  Correlative:  Cause  and  Effect  are  not 
successive  as  immediate  and  non  immediate. 
They  move  progressively  on  a  consequent 
period. pousser-d,  augmenter,  ... 

Effect  modality  represents  in  particular  the  property 
of  effect.  It  means  how  the  effect  reacts  to  the 
cause. 

1.  Starting  effect:  Cause  activates  Effect.  It  can 
allow  to  observe  successive  consequences. 
declencher,  creer,  provoquer, 

2.  Stopping  effect:  Cause  stops  activity  of 
Effect,  bloquer,  annuler,  arreter,  ... 

3.  Increasing  effect:  activity  of  Effect  increases. 
pousser-d,  augmenter,  accelerer,  ameliorer,  . . . 

4.  Decreasing  effect:  activity  of  Effect 
decreases,  diminuer,  ralentir,  reduire,  ... 

5.  Maintaining  effect:  activity  state  of  effect  is 
stable  without  any  modification,  maintenir, 
conserver,  geler,  immobiliser,  ... 

A  verb  can  describe  a  chronological  relation  and 
indicate  an  effect  modality.  A  verb,  such  entrainer, 
describes  a  non  immediate  chronological  relation 
between  cause  and  effect  with  activating  the  effect. 
We  construct  the  three  categories  of  relations 
between  the  cause  and  the  effect  in  the  following 
board. 


Semantic  relations 

•  Static  relations 

•  Cinematic  relations 

•  Dynamic  relations 

Chronological 
relations  (temporal 
distance) 

•  Immediate 

•  Non  immediate 

•  Correlative 

Effect  modalities 

•  Starting  effect 

•  Stopping  effect 

•  Increasing  effect 

•  Decreasing  effect 

•  Maintaining  effect 

The  causal  relations,  obtained  from  an  analyzed 
text,  will  be  represented  in  the  following  figure. 


Xi,  Xz,  ...  Xn  are  causes  of  y 


Immediate  effect: 


Non  immediate  effect:  . *■ 


Decreasing  effect:  - >■ 


Increasing  effect: 


S> 


Fig.  1  -  Different  Causa I  relations  represented  in  graphs 


V.  EXTRACTION  AND  REPRESENTATION  OF 
CAUSAL  RELATIONS  FROM  A  TEXT: 

AN  EXAMPLE 

Now,  we  show  an  example4,  a  medical  text  written 
in  French.  It  talks  about  various  causes  which 
produce  the  problem  “dysosmy”.  We  have  marked 
“dysosmie(s)”,  considered  as  the  effect.  The 
underlined  words  designate  causal  relations;  La  ... 
cause  des  dysosmies  est  ....  Cette  dysosmies  est  due 
a...,  ces  dysosmies  induites  par...,  etc.  In  the  first 
step,  we  extract  the  sentences  which  contain  the 
word  “dysosmie(s)”.  And  then,  we  analyze  verbal 
locutions. 

We  extract  the  sentences  which  contain  the 
information  of  “dysosmie(s)” . 

•  Static  relations 

identification:  La  ...  cause  des  dysosmies  est  ... 
(The  ...  cause  of  dysosmies  is. . .) 
inclusion:  Cette  dysosmie  est  due  a...  (This 
dysosmy  is  due  to. . .) 

non  immediate:  la  dysosmie  secondaire  ct...  (the 
dysosmy  following  to. . .) 

•  Dynamic  relations 

correlative:  Plus  ....  plus...  augmente  (The  more  ..., 
the  more. . .  increases) 

increasing  :  Plus  ...,  plus...  augmente  (The  more 
. . the  more. . .  increases) 

We  represent  the  causes  producing  the  problem 
“ dysosmie(s )"  in  the  following  graphs. 


DYSOSMIES  DE  PERCEPTION 

La  premiere  cause  (30  %  des  dysosmies)  des  dysosmies  de 

perception  est  la  dysosmie  secondaire  a  une  rhinite  aigue.  La 
patiente,  une  femme  de  plus  de  60  ans  dans  70  %  des  cas,  constate 
une  anosmie  durant  une  banale  rhinite.  Mais,  fait  inhabituel,  lorsque 
la  rhinite  aigue  guerit  apres  une  evolution  usuelle  d'une  quinzaine 
de  jours,  la  perte  de  l'odorat  persiste.  Dans  60  %  des  cas,  une 
parosmie  s'installe,  soit  immediatement,  soit  apres  quelques  mois 
devolution.  Cette  dysosmie  est  due  a  la  destruction  du 
neuroepithelium  olfactif  par  les  virus  de  la  rhinite  aigue.  C'est  un 
diagnostic  d'elimination  :  une  maladie  chronique  des  cavites  naso- 
sinusiennes  peut  debuter  de  la  sorte.  Un  examen 
tomodensitometrique  de  la  face  doit  etre  pratique  ;  il  est  normal 
dans  ces  dysosmies  induites  par  une  rhinite.  Moins  d'un  tiers  des 
patientes  ont  une  amelioration  spontanee  de  leur  odorat  dans  les 
mois  et  les  annees  suivant  l'episode  rhinitique.  II  n'existe  aucun 
traitement  preventif  ni  curatif. 

La  deuxieme  cause  des  dysosmies  est  le  traumatisme  cranien. 
Le  choc  est  le  plus  souvent  occipital,  plus  rarement  frontal.  Plus 
l'intensite  du  choc  est  grande,  plus  la  frequence  des  dysosmies 
augmente.  L'association  d'un  traumatisme  facial  et  d'un 
traumatisme  cranien  augmente  considerablement  le  risque  de 
dysosmie.  La  physiopathologie  de  cette  dysosmie  n'est  pas 
univoque  :  lesions  du  neuroepithelium  olfactif,  section  des  neurones 
olfactifs  primaires  au  niveau  de  la  lame  criblee,  contusion  des 
centres  olfactifs.  Une  amelioration  peut  survenir  durant  les 
premieres  semaines  apres  le  traumatisme  (par  diminution  des 
lesions  cerebrales  frequentes).  Ces  lesions  olfactives  posent  le 
probleme  de  la  reparation  juridique  du  dommage  corporel.  Si 
l'imputabilite  du  trouble  est  admise,  le  bareme  d'invalidite  de  ces 
sequelles  en  droit  commun  propose  un  taux  de  2  %  ou  3  %  en  cas 
d'hyposmie,  de  5  %  en  cas  d'anosmie  avec  une  maj oration  de  1  % 
en  cas  de  parosmie.  II  n'existe  aucun  traitement. 

La  troisieme  cause  des  dysosmies  est  le  vieillissement.  La 
moitie  des  sujets  ages  de  65  ans  a  80  ans  souffre  d'un  important 
deficit  olfactif.  Apres  80  ans,  75  %  des  sujets  sont  anosmiques  ou 
fortement  hyposmiques.  II  semblerait  que  ce  trouble  de  l'odorat,  a 
l'origine  d'une  forte  hypogueusie,  soit  responsable  du  desinteret  des 
personnes  agees  pour  l'alimentation.  Le  deficit  olfactif  touche  aussi 
bien  la  sensibilite  olfactive  (hyposmie,  voire  anosmie)  que  la 
perception  de  l'intensite  d'une  odeur,  l'identification  des  molecules 
odorantes  et  la  memorisation  de  nouvelles  odeurs.  L'ensemble  du 
systeme  olfactif  vieillit,  du  neuroepithelium  jusqu'au  cortex.  II 
n'existe  aucun  traitement. 


4  This  text  is  a  part  of  an  article  published  in 
www.  legenerali  sete  .presse .  fr 


Fig.2  -  Representation  in  graphs:  various  causes  producing  “dysosmie  de  perception  ” 


This  representation  shows  the  causal  relations 
strictly  related  to  the  problem  “dysosmy”.  But  this 
text  contains  other  information:  1)  problems  related 
to  other  pathologies  (“ anosmie ”,  “ hyposmie ”, 

“ parosmie ”),  2)  statistic  results  (dans  70  %  des  cas, 
30  %  des  dysosmies,  la  moitie  des  sujets  ages  de  65 
ans  a  80  ans,..) 

VI.  CONCLUSION 

In  the  medical  domain,  the  causality  consists  to 
search  not  only  general  causes  with  a  certain 
number  of  statistics,  but  also  exceptional  causes 
which  are  rare,  but  could  be  very  important  in 
certain  cases. 

Filtering  information  in  a  context  requires  a 
vigorous  work,  but  it  should  be  relatively  dependant 
of  natural  languages.  The  causal  relations  are 
complex  and  even  a  simple  sentence  can  supply 
several  different  relations  between  the  cause  and 
the  effect.  It  is  evident  that  this  research  of  verbal 
semantic  analysis  is  only  a  part  of  the  extraction  of 
medical  causal  information.  To  obtain  the  maximal 
information  with  completing  the  knowledge  data 
base,  it  is  necessary  to  dig  in  the  Semantics  of  texts. 
In  a  context  of  very  large  internet  information,  our 
method  can  help  to  classify  the  knowledge  data 
concerning  specific  domains  in  Medicine.  For 
experts,  it  is  useful  to  increase  knowledge  data 
base,  and  it  could  help  non  experts,  students, 
administrators  of  hospital,  and  so  on,  to  understand 
and  organize  their  tasks. 
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